A Named Entity Recognizer for Filipino Texts
نویسندگان
چکیده
In this paper, we define the task of named entity recognition, look at existing systems for named entity recognition, and discuss the design, implementation, and evaluation of a system that performs named entity recognition on Filipino texts. We also compare the results of the system with an existing named entity recognizer designed for English texts using a Filipino corpus.
منابع مشابه
Czech Named Entity Corpus and SVM-based Recognizer
This paper deals with recognition of named entities in Czech texts. We present a recently released corpus of Czech sentences with manually annotated named entities, in which a rich two-level classification scheme was used. There are around 6000 sentences in the corpus with roughly 33000 marked named entity instances. We use the data for training and evaluating a named entity recognizer based on...
متن کاملNamed Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning
We present a freely available named-entity recognizer for Greek texts that identifies temporal expressions, person, and organization names. For temporal expressions, it relies on semi-automatically produced patterns. For person and organization names, it employs an ensemble of Support Vector Machines that scan the input text in two passes. The ensemble is trained using active learning, whereby ...
متن کاملNEROC: Named Entity Recognizer of Chemicals
We describe a pipeline system, Named Entity Recognizer of Chemicals (NEROC), that aims to identify chemical entities mentioned in free texts. The system is based on a machine learning approach, a Conditional Random Field (CRF), and a selection of feature sets that are used to capture specific characteristics of chemical named entities. In this paper, we report results that produced by CRF model...
متن کاملNamed Entity Recognition in Greek Texts
In this paper, we describe work in progress for the development of a named entity recognizer for Greek. The system aims at information extraction applications where large scale text processing is needed. Speed of analysis, system robustness, and results accuracy have been the basic guidelines for the system’s design. Our system is an automated pipeline of linguistic components for Greek text pr...
متن کاملTowards automatic recognition of product names: an exploratory study of brand names in economic texts
This paper describes the first stage of research towards automatic recognition of brand names (trademarks, product names and service names) in Swedish economic texts. The findings of an exploratory study of brand names in economic texts by Malmgren (2004) are summarized, and the work of compiling a corpus annotated with named entities based on these findings is described. A Named Entity Recogni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007